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A Method for Mapping the Active Sites Bound by Enzymesthat Covalently Modify 

Substrate Molecules 

Field of Invention 

This invention is immediately relevant to many medical fields including inflammation, 
autoimmunity, transplantation, cancer, anti-microbiais ? virology, metabolic disease and 
allergy. Methods to identify selective substrates of specific enzymes are indicated, based on 
the detailed mapping of the substrate binding site using combinatorial peptide libraries. 
These enzymes can be any molecule that covalently modifies its physiological substrate 
target, examples of which include, but are not limited to, protein kinases, protein 
phosphatases, acetyiases and ribosylases. Tnese derived substrate-based compounds can 
serve as a basis for further medicinal chemistry development of selective enzyme 
inhibitors. The identification of short peptidic substrates using this methodology will also 
allow for the rapid development of high throughput screens for compound screening 

Background to Invention 

The mapping method is exemplified using members of the protein kinase enzyme familv, 
but this method is applicable to other covalently modifying enzymes. 

Phosphate transfer (phosphorylation) is the most common form of covalent protein 
modification used by cells. Protein kinases are the enzymes that catalyse the transfer of the 
y-phosphate from adenosine triphosphate (ATP) to an amino acid residue (usually tyrosine, 
threonine, serine or histidine) on a substrate molecule. Approximately 400 kinases are 
currently known, and it is likely that this number will increase considerably in the next few 
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years as more information from gene sequencing databases becomes available. 
Functionally, these molecules are intracellular enzymes that play key roies in cell srowth, 
differentiation and inter-cell communication. Aberrant protein kinase activity has been 
implicated in many disease states including several forms of cancer and severe -combined 
immunodeficiency disease (Barker et al 1997; Lehtola et al; Aipaia et al 1994; Elder et al 
1995; Roinnan, 1995). Similarly, activation of protein kinase activity within mononuclear 
cells is required to drive the cytokine production which underlies many autoimmune 
diseases (Lee et al, 1994). Thus, inhibitor compounds capable of specifically inactivating 
certain critical kinases may have considerable therapeutic benefit in a number of clinical 
diseases. 

All tyrosine and serine / threonine protein kinases have a region of approximately 300 amino 
acids known as the catalytic subunit which has evolved from a common ancestor kinase 
(Hanks and Quinn, 1991). Crystal structure determination of several kinases has shown that 
they all have a common bi-lobal structure (Wilson et al, 1996; Zhang et al, 1994; Xu et al, 
1997). The ammo-terminal part of the subunit encodes a small lobe responsible for the 
binding of ATP, whereas the carboxy-terminai residues encode a larger lobe important for 
protein substrate binding. In the tertiary structure of the active kinase, both the_ATP and 
the protein substrate binding sites are brought together allowing transfer of the ATP y- 
phosphate to the amino acid acceptor on the protein substrate. The protein/peptide binding 
groove stretches across the face of the large lobe between two a-helices and under the 
small lobe. This groove therefore contains the residues important for defining the substrate 
specificity of the kinase. 

Many protein kinases are arranged in kinase cascades within the cell, providing the ability 
for signal amplification in post-transduction pathways. This amplification relies on the 
upstream kinase specifically activating its downstream partner. For this reason, protein 
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kinases have developed remarkable substrate specificities which prevent unwanted 
crosstalk between different kinase cascades. Pais substrate specificity can be exploited in' 
the design of selective protein kinase inhibitors. 

A technique has recently been developed by L. Cantiey's laboratory to provide, consensus 
peptide protein kinase substrate information (WO 95/13323, Songyang et al, 1994). First, a 
degenerate library of peptides with a phospho-accsptor such as tyrosine or serine'threonine 
flanked by amino acids on each side is synthesised. A preferred number of degenerate 
residues on each side of the phosphorylation site is four (corresponding to positions -1, -2, - 
3, -4, -rl, -t-2, -r3, +4) relative to the phosphoryiated residue. Thus the library consists of 
peptides having a length of nine amino acids. The library is then phosphotylated by the 
protein kinase of interest and phosphoryiated peptides isolated from the non- 
phosphorylated peptides by DEAE-sephacel and ferric chelation chromatography. The 
phosphopeptide mixture is then sequenced and the frequency of each amino acid at every 
position assessed to give a preferred substrate sequence. These studies have yielded 
consensus substrate information, but do not allow a detailed analysis of particular 
preferences for neighbouring residue interactions as pools of peptides are examined. 
Furthermore, this type of analysis may not show up rare good substrates which-Could be 
hidden by the presence of numerous poor substrates in the peptide pool. By this method 
individual peptides can never be identified as individual sequences, the result is that an 
average picture of substrate specificity is reached Part of the problem is that each 
individual peptide is represented at such a low level, and many inevitably will not even be 
present. The results from Cantley's method do not represent individual peptides but a 
consensus picture of protein kinase substrate specificity. 

Filamentous phage expressing gene El-linked degenerate peptide sequences have also been 
used to generate substrate information (Schmitz et a/, 1996; Dente et al y 1997), however 
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this method is labour intensive and does not allow the use of unnatural amino acids or 
peptidomimetics. Substrate information can also be obtained from knowledse of the 
physiological kinase substrates. This approach is limited and previous attempts to utilise 
this information for the design of successful therapeutic cell permeable protein kinase 
inhibitors has failed (Kemp et aL 1991). 

We believe that identification of detailed substrate characteristics can intimately map the 
substrate binding groove and provide information that can lead to the design of enzyme 
inhibitor molecules. For the reasons described above, there are no current methods for 
obtaining this information- Therefore, we have invented a method of using small molecules, 
in a self deconvo luting library format, to probe a larger active site by positional sc annin g of 
a target group. This method is rapid, not labour intensive and results in the identification of 
discrete sequences. 

Summary of Invention 

This invention provides for the active site mapping of enzymes which catalyse-covalent 
modification including, but not limited to phosphorylation, acylation, dephosphorylation in 
which a fixed residue (hereafter known as the catalytic , residue) such as a tyrosine, serine, 
threonine, histidine, aspartic acid residue or any other residue containing an appropriate 
side chain is modified. The method of the invention has an additional level of complexity 
over and above that of the self-deconvo luting libraries described in W097/42216 (the 
content of which is incorporated herein by reference, where legally permissible). 

* 

This involves making a library of smaller libraries (referred to as sub-sets) where a fixed 
residue is moved stepwise through the sequence of amino acids or other groups (such as 



SUBSTITUTE SHEET (RULE 26) 



WO 99/23109 



PCT/GB98/03259 



5 

peptidomimerics [any compound that can be added to the substrate or inhibitor chain]). The 
result is that in each sub-set of the library the fixed residue is found in a different position! 
For example, in a library using four variable positions, five sub-sets in each library have to 
be made, ZXXXX, XZXXX, XXZXX XXX2X and XXXXZ where Z is the fixed residue 
and X are the four variable residues. We recognise that there may be a need in certain 
circumstances for further invariant residues, however these would occupy fixed positions 
and would not affect either the scanning or the seif-deconvo luring of the libraries. Tne 
invariant residue(s) might be fixed in position relative to the modifiable residue Z or may 
be fixed in position relative to the overall motif sequence. Additional fixed residues can be 
added if desired, or one of the variable residues can be made invariant. In the later case the 
library would be a small part of the libraries described here. Cases where it is desirable to 
include one or more fixed residues include libraries required to look at enzymes which 
always require another invariant residue in another position. However, in cases where two 
fixed residues are required, and they are both modified, it can be desirable to include this 
residue in one of the variable positions (i.e. make it one of the residues chosen in a variable 
position). The reason for this is that the sequence of events (the order in which the two 
residues become modified) can then also be probed by this scanning library technique. In 
this case it may also be beneficial to make an additional library in which the fixed-tesidue is 
not present at all, corresponding to the library XXXX. We would therefore have a library of 
six sub-sets. These modifications are within the scope of the invention and would be 
recognised by someone skilled in the an. 

It can be readily seen that by combining the data from each library sub-set, the residues 
from -4 to +4 either side of the catalytic residue can be mapped: 

« 

A-B-C-D-Z 
B-C-D-Z-E 
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C-D-Z-E-F 
D-Z-E-F-G 
Z-E-F-G-H 

The mapped sequence would therefore be A-B-C-D-Z-E-F-G-H. 

The above example using 5 subsets of libraries of peptides of 5 amino acids allows the 
mapping of a sequence of 9 amino acids. In general one could carry out the invention usins 
n subsets of n-mer peptides so as to provide mapping data for the residues from -{n-1) to 
-Kn-1) either side of the active site. Thus in general the length of the mapped sequence 
would be (2n)-l. 

Where the residue type at any given position relative to the fixed residue is similar in 
different subsets, the data can be used in an additive manner. For example, if an aromatic 
residue is required adjacent to the fixed residue, then any sequences which contain this 
feature in any of the library subsets can be considered in an additive way. 

In this invention there is no need to separate modified from unmodified sequencSs'because 
of the self deconvo luting nature of the library. The assay screen produces a series of hits, 
the patterns of which reveal the unique sequences in each well. This enables a pattern of 
substrate preferences to be determined for any enzyme. 

The unique sequences obtained using this invention can be used to provide substrates for 
high throughput assays and provide detailed information about the active site to aid rational 
drug design. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/23109 



PCT/GB98/03259 



/ 

This invention can also be used as an inhibitor library to screen against known modify ins 
enzymes where a known substrate exists and can be set up in an~assay format. Tnose skilled 
in the an would realise that by replacing the fixed residue with a suitable compound that is 
not modified an inhibitor library can be constructed. For example if a modifiable fixed 
tyrosine were to be changed to a tyrosine derivative residue that cannot be phosphoryiatecL 
such as halogenated tyrosine, dopamine, or tyrosine substituted by aromatic compounds, 
then an inhibitor library will be formed. Tois could allow the more direct identification of 
prototype inhibitors of enzymes for rational drug design. 

Use of these libraries could be extended to other systems where a defined endpoint is 
desired, but the target enzyme is unknown. Such examples could include, but are not 
limited to, bacterial lysis in growing cultures or inhibiting phosphorylation of transcription 
factors in cell lysates. 

In one embodiment of this invention the sequences identified by this method are 
considerably smaller than have previously been reported for library screens on protein 
kinase substrates, which makes them more amenable to computer modelling and drug 
design. Furthermore, this methodology provides information about the relative relationships 
between neighbouring residues of active substrates; information which is not available from 
a straightforward oriented degenerate peptide library approach used by Cantley (Songyang**^ 
et al 7 1994). Thus, this novel methodology provides a significant improvement in the 
quality of substrate based information that is achievable, in comparison to that produced 
from previously described methods. 

This invention allows data to be obtained from single peptide rankings which could be used 
to rationally design sets of enzyme inhibitor molecules which compete with the 
physiological substrate for binding to the active site of the enzyme. 
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Description of the Drawings 

Figure L Design of protein tyrosine kinase library. Each peptide consists of a biotin tag, an 
epsilon amino hexanoic acid spacer and 5 amino acids, including a phosphorylatabie 
tyrosine residue. Each of the amino acid positions A-H is varied as described. For example, 
Al-10 means that position A is varied using 10 defined amino acids. 

Figure 2. Best substrates identified by screening tyrosine library sub-sets 1 to 5 against 
ZAP-70 protein tyrosine kinase. Tne protein tyrosine kinase library described in Figure 1 
was phosphorylated for 30 minutes at 30°C using the catalytic domain of human ZAP-70. 
Peptides were captured using strepavidin-coated 96 well plates and phosphotyrosine 
detected using anti-phospho tyrosine antibody, anti mouse IgG-HRP and 
tetramethyibenzidine (see experimental methods). Best substrates were identified as those 
which gave the highest amount of phosphate incorporation. 

Figure 3. Km Determination of Biotin-£AHA-DEEDYFE(Nie) [SEQ ID NO. 3]. The 
catalytic domain of human ZAP-70 was used to phosphoryiate varying concentrations of 
peptide for 10 minutes at 30°C in the presence of JJ P-y-ATP. Peptide capture was 
performed using strepavidin filter plates, scintillation fluid added, and counting performed 
using a beta-counter (see experimental methods). Samples were assayed in triplicate. 

Description of the Invention 

This invention provides for the active site mapping of enzymes which catalyse covalent 
modification including, but not limited to, phosphorylation, acylation, dephosphorylation in 
which a residue such as a tyrosine, serine, threonine, histidine, aspartic acid or any other 
residue containing an appropriate side chain is modified. 
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Thus, the invention provides a method for determining an amino acid sequence motif or a 
pep ci do mimetic sequence motif containing an active site capable of being bound by an 
enzyme which catalyses covalent modification of a substrate molecule, comprising; 

a) contacting the enzyme with a library consisting of a number of oriented 
degenerate library subsets of molecules, each subset comprising 
unmodified degenerate motif sequences each having n residues and each 
having a modifiable residue at a different fixed non-degenerate position, 
under conditions which allow for modification of molecules which are a 
substrate for the enzyme; 

b) allowing the enzyme to modify modifiable residues in library subsets 
containing molecules having an active substrate site for the enzyme; 

c) deconvo luting the oriented degenerate library subsets of the library, in situ 
without separating modified from unmodified molecules, so as to reveal 
the sequence of any motif which has been modified by covalent binding of 
the enzyme; 

wherein each library subset is of formula (I) 

(Xaa) x Zaa (Xaa) y (I) [SEQ ID No. 1 ] 

wherein 

Zaa is a non-degenerate modifiable natural or unnatural amino acid residue or 
peptidomimetic; 

Xaa is any natural or unnatural amino acid residue or peptidomimetic; 
x and y are each independently 0 or an integer, 
(x + y) = (n-l);and 

n = an integer from 3 to 8, preferably 5. 
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This invention can be applied for instance to a protein tyrosine kinase in order to exemplify 
the technology. It provides a rapid method of identifying discrete protein kinase substrate 
sequences which allows pharmacophore generation and design of active site inhibitors. This 
invention can also be used to directly identify protein kinase inhibitor molecules. General 
formula: (Xaa) x Tyr (Xaa) y [SEQ.ED No. 2]. 

In the first exemplification of this invention, a recombinant form of the human ZAP-70 
enzyme was used in an in vitro phosphorylation reaction to phosphorylate the five substrate 
sub-libraries which scan the sequence -4 to +4 around a central tyrosine residue (Figure 1). 
Tne libraries were arranged in 96 well micro titre plate format with pools of 20 peptides in 
each microtitre well. However, those skilled in the an will realise that the library can be 
constructed on any scale. For example the library Sub-Set can be miniaturised on a "chip" 
scale or constructed on a large bulk scale depending on the requirements of the library. 

Library peptides were made with biotin tags, which allowed peptide capture on strep avidin- 
coated microtitre plates. Detection of phosphotyrosine was achieved using anti- 
phosp ho tyrosine antibody detection in an ELISA assay using tetramethyibenzidine 
substrate and recording absorbance at 450 nm. Background absorbance readings~of 0. 1 to 
0.2 were recorded while the highest substrate peptide value was 1.5. Deconvolution of the 
hit peptides was performed as described in WO 97/42216. Clear defined substrates were 
deconvo luted in library sub-sets 1 to 4, but not in 5. This probably reflects the absolute 
requirement of ZAP-70 for an amino acid residue in the -1 position. 

For the purpose of this exemplification, the peptides used were tagged with d-Biotin and a 
linker (epsilon amino hexanoic acid or some other spacing group). In principle any tag and 
linker can be used, although this invention also provides that a tag and linker does not have 
be present if mass spectroscopy, for example, is used to identify the peptide hits. The 
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purpose of the tag is solely to enable capture of all of the peptides (whether modified or 
not) so that excess reagents can be washed away. The reporting systems to detect peptide 
modification can include, but are not limited to. antibody recognition, radioactive assay or 
mass spectroscopy. 

In the library used to exemplify the invention, a Biotin tag was chosen because we believed 
that this would give improved results. The reasoning for this choice of tag was because of 
the high level of positively charged groups on the enzyme in the area in which the tag sits. 
This charged area would cause unfavourable interactions with tags more commonly use by 
others in the field, such a poly- lysine or poiy-arginine. We would expect this reasoning to 
be applicable to any enzyme which binds a highly negatively charged molecule such as but 
not limited to ATP, close to the peptide binding site. 

Tags are preferably non peptidic, with as little charge, either positive or negative, as 
possible. Biotin is a good example of this. The aim is to minimise the interactions of the 
tag with the protein so the resultant hits are largely due to the binding of the peptides rather 
than reflecting the binding of the tag. Pne best method of all if this argument is applied to 
its logical conclusion would be to >not use a tag at all and use mass spectroscopy tOrfcientify 
the peptides. However, currently this approach is of limited value due to the time taken to 
run and analyse a library of the size used here to exemplify the invention. 

The results obtained from the library screen clearly demonstrated amino acid residues 
preferred by the protein kinase at each of the -4 to +4 sites (Figure 2). The 5 mer peptides 
overlapped to give information on amino acid preference at each of the binding positions -4 
to +4. To confirm this a consensus peptide, Biotin-sAHA-DEEDYFE(Nle) [SEQ ID No. 
3], representing the best -4 to +4 amino acids was made and tested as a substrate (Figure 3). 
This substrate gave a Km against ZAP-70 of 15.79 jiM, which is better than the best ZAP- 
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70 substrate described in the literature, a longer peptide of 14 amino acids with a tas of 3 
arginines and a K m of 29 jiM (Wandenburg et al, 1996). 

In the second application of this invention, a recombinant form of the human Syk enzyme 
was used in an in vitro phosphorylation reaction to phosphorylate the five substrate sub- 
libraries which scan the sequence -4 to h-4 around a central tyrosine residue, as previously 
performed for the ZAP-70 library. Detection of phosphotyrosine was achieved using anti- 
phosphotyrosine antibody detection in an ELISA assay using tetramethylbenzidine 
substrate and recording absorb ance at 450 am. Background absorb ance readings of 0.10 
were recorded while the highest substrate peptide value was 1.46. Deconvolution of the hit 
peptides was performed as described in WO 97/42216. Clear defined substrates were 
deconvoluted in all library sub-sets. 

In the third application of this invention, a recombinant form of the human CSK enzyme 
was used in an in vitro phosphorylation reaction to phosphorylate the five substrate sub- 
libraries which scan the sequence -4 to +4 around a central tyrosine residue, as previously 
performed for the ZAP-70 library. Detection of phosphotyrosine was achieved using anti- 
phosphotyrosine antibody detection in an ELISA assay using tetramethylbenzidine 
substrate and recording absorb ance at 450 nm. Background absorb ance readings of 0.04 
were recorded while the highest substrate peptide value was 0.22. Deconvolution of the hit 
peptides was performed as described in WO 97/42216. Clear defined substrates were 
deconvoluted in all library sub-sets. 

In the fourth application of this invention, a recombinant form of the Abelson murine 
leukaemia virus protein tyrosine kinase v-Abl was used in an in vitro phosphorylation 
reaction to phosphorylate the library sub-set 4 which scans the sequence -1 to +3 around a 
zero position tyrosine residue, as previously performed for the ZAP-70 library. Detection of 
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phosp ho tyrosine was achieved using anti-phosp ho tyrosine antibody detection in an ELISA 
assay using tetramethyibenzidine substrate and recording absorb ance at 450 am.' 
Background absorbance readings of 0.1 1 were, recorded while the highest substrate peptide 
vaiue was 0.32. Deconvolution of the hit peptides was performed as described in WO 
97/42216. Clear denned substrates were deconvo luted in the library sub-set. 

In the fifth application of this invention, the invention was used to map the substrate 
specificiry of a protein serine or serine/threonine kinase (which include I-kappa B kinase 
beta and cAMP-dependent protein kinase [cAPK]). A protein serine or serine/threonine 
kinase enzyme was used in an in vitro phosphorylation reaction to phosphorylate the five 
substrate sub-libraries which scan the sequence -4 to -r-4 around a central serine residue. 
The library was synthesised as the protein tyrosine kinase ZAP-70 library save that the 
tyrosine fixed residues were replaced with a serine which was then scanned through the five 
sub-libraries. Detection of phosphoserine was achieved using anti-phosphoserine antibody 
detection in an ELISA assay using tetramethyibenzidine substrate and recording absorbance 
at 450 nm. Deconvolution of the hit peptides was performed as described in WO 97/42216. 

General formula: (Xaa) x Ser (Xaa) y [SEQ ID No. 4]. 
Library Sub-Set 1 Xaa-Xaa-Xaa-Xaa-Ser 
Library Sub-Set 2 Xaa-Xaa-Xaa-Ser-Xaa 
Library Sub-Set 3 Xaa-Xaa-Ser-Xaa-Xaa 
Library Sub-Set 4 Xaa-Ser-Xaa-Xaa-Xaa 
Library Sub-Set 5 Ser-Xaa-Xaa-Xaa-Xaa 



Likewise the Library Sub-Sets can be synthesised for the piapping of threonine kinases by 
the synthesis of a library containing the threonine residue to allow phosphorylation by 
enzymes recognising this residue. 
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It will be realised by those skilled in the art that the replacement of the recognition residue 
such as the tyrosine, or serine, with a residue that is covalently modified by the enzyme to 
be mapped allows the active site of any such enzyme to be determined according to the 
invention. 

The invention will now be described by reference to the following examples. 
Example 1 

In this example the invention was used to map the active catalytic site of ZAP-70, a protein 
kinase enzyme that catalyses the phosphorylation of a tyrosine residue. The example 
illustrates the synthesis of a number of compounds, and their use as a sub-set library for the 
mapping of the enzyme so as to allow the subsequent identification and synthesis of single 
specific substrates for the enzyme. 

Synthesis of Peptide Compounds. 

Preparation of Crown Assembly 

The peptide compounds were synthesised in parallel fashion using Fmoc-Rink-DA/MDA 
derivatised gears (ex Chiron Mimotopes, Australia) loaded at approximately 1.6 per 
crown. Prior to synthesis each crown was connected to its respective stem and slotted into 
the 8 x 12 stem holder. Coupling of the amino acicis employed standard Fmoc amino acid 
chemistry as described in 'Solid Phase Peptide Synthesis!, E. Atherton and R.C. Sheppard, 
IRL Press Ltd, Oxford, UK, 1989. 
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Removal of N -Fmoc Protection 



A 250 ml solvent resistant bath is charged with 200 ml of a 20% piperidine/DMF solution. 
The multipin assembly is added and deprotection allowed to proceed for 30 minutes. Tne 
assembly was removed and excess solvent removed by brief shaking. The assembly is then 
washed consecurively with (200 ml each), DMF (5 minutes) and MeOH (5 minutes, 5 
minutes, 5 minutes) and left to air dry for 15 mi nutes. 



Quantitative UV Measurement of Fmoc Chromophore Release 

A 1 cm path length UV cell is charged with 1.2 ml of a 20% piperidine/DMF solution and 
used to zero the absorbance of the UV spectrometer at a wavelength of 290nm. A UV 
standard is then prepared consisting of 5.0 ma Fmoc-Asp(OBut)-Pepsyn KA (0.08 mmol/g) 
in 3.2 ml of a 20% piperidine/DMF solution. This standard gives Abs :9 o = 0.55-0.65 (at 
room temperature). An aliquot of the multipin deprotection solution is then diluted as 
appropriate to give a theoretical Abs 29 o = 0.6, and this value compared with the actual 
experimentally measured absorbance showing the efficiency of previous coupling reaction. 

Coupling Of Standard Amino Acid Residues 

Coupling reactions were performed by charging the appropriate wells of a polypropylene 
96 well plate with the pattern of activated solutions required during a particular round of 
coupling. Gear (approx 1.6 umole) standard couplings were performed in DMF (300 ul). 

Coupling of an Amino-acid Residue To Appropriate Well 
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Whilst the multipin assembly is drying, the appropriate N-Fmoc amino acid pfp esters (10 
equivalents calculated from the loading of each crown) and HOBt (10 equivalents) required 
for the particular round of coupling are accurately weighed into suitable containers. 
Alternatively, the appropriate N-Fmoc amino acids (10 equivalents calculated from the 
loading of each crown), desired coupling agent e.g. HBTU (9.9 equivalents calculated from 
the loading of each crown) and activation e.g. HOBt (9.9 equivalents calculated from the 
loading of each crown), NMM (19.9 equivalents calculated from the loading of each 
crown) were accurately weighed into suitable containers. 

The protected and activated Fmoc amino acid derivatives were then dissolved in DMF (300 
1 for each gear e.g. for 20 gears, 20 x 10 eq. x 1.6 jimoles of derivative would be dissolved 
in 10 ml DMF). The appropriate derivatives were then dispensed to the appropriate wells 
ready for commencement of the 'coupling cycle'. As a standard, coupling reactions were 
allowed to proceed for 6 hours. The coupled assembly was then washed as detailed below. 

Coupling of d-Biotin acid to pins 

d-Biotin (lOeq), l-hydroxybenzocriazole.H 2 0 (lOeq), BOP (9.95eq) and NM^r (19.9eq) 
were dissolved in DMF (0.3mL per well) and agitated for 2 minutes. 300 p.L of solution 
was dispensed to each well of a 96- well polypropylene plate. Tne gears were then added to 
the solution and left for 24 hours. Fresh solution was made up, the gears washed in DMF 
for 5 minutes and then added to the fresh coupling mixture and left a further 24 hours. 

The pin assembly was removed from the plate, shaken free of excess liquid then immersed 
in DMF (200mL) for 5mins. The assembly was again shaken then immersed in MeOH 
(200mL, 3 x 5mins) and allowed to air dry. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/23109 



PCT/GB98/03259 



17 

Washing Following Coupling 

If a 20% piperidine/DMF deprotection is to immediately follow the coupling cycle, then the 
multipin assembly is briefly shaken to remove excess solvent washed consecutively with 
(200 ml each), MeOH (5 minutes) and DMF (5 minutes) and de-protected (see 6.2). If the 
multipin assembly is to be stored or reacted further, then a full washing cycle consisting 
brief shaking then consecutive washes with (200 ml each), DMF (5 minutes) and MeOH (5 
minutes, 5 minutes, 5 minutes) is performed. 

Following these general methods, the peptide libraries shown in Figure 1 were sequentially 
assembled by applying the appropriate coupling procedure at the correct cycle during 
synthesis. 

Acidolytic Mediated Cleavage of Pep tide-Pin Assembly 

Acid mediated cleavage protocols were strictly performed in a fume hood. A polystyrene 
96 well plate (1 ml/well) was labelled, then the tare weight measured to the nearest mg. 
Appropriate wells were charged with a trifluoroacetic acid/triisopropylsilane (95:5, v/v, 600 
yd) cleavage solution, in a pattern corresponding to that of the multipin assembly to be 
cleaved. 

The multipin assembly is added, the entire construct covered in tin foil and left for 2 hours. 
The multipin assembly in then added to another polystyrene 96 well plate (1 ml/well) 
containing trifluoroacetic acid/triethylsilane (95:5, v/v, 600 ^1) (as above) for 5 minutes. 

Work up of Cleaved Peptides 
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The primary polystyrene cleavage plate (2 hour cleavage) and the secondary polystyrene 
plate (5 minute wash) were then placed in the SpeedVac and the solvents removed 
(minimum drying rate) for 90 minutes. 

The contents of the secondary polystyrene place were transferred to their correspondins 
wells on the primary plate using an acetonitrile/water/acetic acid (50:45:5, v/v/v) solution 
(3 x 150 jal) and the spent secondary plate discarded. 

Analysis of Products 

A 5uL aliquot from each well is diluted to 100 \il with 0.1% aq. TFA, then a lOuL aliquot 
from this plate diluted with a further 100 jal 0.1% aq. TFA. The double diluted plate was 
analysed by HPLC-MS. 

Final Lyophilisation of Peptides 

The plate was covered with tin foil, held to the plate with an elastic band. A pin prick was 
placed in the foil directly above each well and the plate placed at -80°C for 30 minutes. The 
plate was then lyophilised on the 'Heto freeze drier' overnight. Finally, the dried plate was 
weighed. The total cleaved peptide was quantified (by weight) and the average content of 
each peptide calculated. Since all the peptides present have originated from the same 
peptide-pin assembly, cleaved under identical conditions, it is reasonable to assume that the 
contents of each well are roughly equimolar. 

Protein kinase cloning, expression and purification 
Polymerase chain reaction (PCR) and downstream cloning 



SUBSTITUTE SHEET (RULE 26) 




WO 99/23109 PCT/GB98/03259 



19 

The coding sequence for human ZAP-70 amino acid 306-615 was amplified from Jurkat T 
cell cDNA by PCR (2 minutes at 94°C, followed by 35 cyQles of 15 seconds 94°C, 30 
seconds 65°C 7 2 minutes 72°C and a final single 5 minute 72°C incubation) using the 
primers: 

5' CCGGGATCCGCCATGCCCATGGACACGAGCGTGTAT 3 ' [SEQ ID No. 5] 

5' GGGGGATCCTCAGTGGTGGTGGTGGTGGTGGGCACAGGCAGCCTCAGC 
CTTCTGTGT 3 ' [SEQ ID No. 6] 

The PCR amplicon was cloned into the Bam HI site of pUC19 and sequence confirmed 
using M13-20 and reverse primers on an Applied Biosystems Prism 310 sequencer as 
described by manufacturer. The Bam HI ZAP-70 insert was excised from the sequencing 
vector and ligated into the Baculoviral transfer vector p AcUW5 1 (Pharmingen). 

Generation of ZAP-70 enzyme using baculovirus 

Homologous recombination with wild type baculoviral DNA was then performed in Sf9 
insect cells and viral supernatant harvested. Plaque purified virus was exposecf-to several 
viral amplification steps then used at a titre of 3xl0 9 PFU/ml to infect 3 1 IxlO 6 cells/ml Sf9 
cells at an MOI of 10 in an Applecon bioreactor using 60% dissolved oxygen. Cells were 
harvested 3 days post infection. 

Protein purification 

The infected cell pellet was lysed in 50 mM Tris-HCl, pH 7.5, 0.15 M NaCl, 25% sucrose, 
1 mM 4-nitrophenol phosphate, 1 mM sodium orthovanadate and protease inhibitors. 
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Following homogenisation using a dounce pestle B, the cleared lysate was loaded onto a 
cobalt-sepharose column. After column washing with lysis buffer, elution was performed 
with an imidazole gradient and ZAP-70 fractions identified by protein kinase activity 
against the peptide substrate Lys-Lys-Lys-Lys-Ala-Asp-Glu-Glu-Asp-Tyr-Phe-Ile-Pro-Pro- 
Ala as described in Casnellie et al 9 1991 

Library Screening 

Library peptides were phosphorylated in pools of 20 peptides at a final concentration of 1 
jiM total peptide in 50 mM HEPES, pH 7.5, 0.1% Triton X-100, 100 jj.M ATP, 10 mM 
MnCh, 1 mM DTT and 0.2 mM sodium orthovanadate for 30 minutes at 30°C These 
reaction mixtures were then stopped using 1 00 mM EDTA, 6 mM adenosine, transferred to 
strepavidin-coated microtitre plates and allowed to bind for 30 minutes at 20°C. 
Unincorporated reaction products were washed from the plate using PBS/0.1% Tween 20 
then plate incubation performed with anti-phosphotyrosine antibody (Sigma mouse 
monoclonal clone PT66) in 2% BSA/PBS/Tween for 1 hour at 20°C. Unbound antibody 
was removed by plate washing using PBS/Tween then incubation performed with rabbit 
anti mouse-HRP (Amersham) for a further 1 hour. HRP detection was then pe rfor med with 
tetramethylbenzidine (TMB) substrate and measuring absorbance at 450 nM using a 
spectrophotometer (Dynex MRX). 

The best substrates were identified as those which gave the highest amount of phosphate 
incorporation. The library subsets were deconvoluted according to the teaching of 
W097/42216: this gives an immediate determination of the unique sequence of any 
phosphorylated motif without the need for further synthesis or sequencing. (Figure 2 [SEQ 
EDNos. 7,8,9,10]). 
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Peptide K m determination 

Biotin-tagged peptides were phosphorylated at varying concentrations in 50 mM HEPES, 
pH 7.5, 0.1% Triton X-100, 200 jiM ATP, 10 mM MnCh, 1 mM DTT and 0.2 mM sodium 
ortho vanadate using 0.5 jiCi 3j P-y-ATP for 10 minutes at 30°C. The reactions were stopped 
using 2 M guanidine hydrochloride, diluted 1 in 10 in water then 5 pi reaction mix spotted 
onto SAM™ titre plates (Promega). Unincorporated reaction products were washed as 
described by manufacturer then 20 p.1 scintillation liquid added and plate counted on a 
Packard TopCount beta-counter. K m was calculated using a non-linear one site hyperbola 
model (ATP was added in excess to negate influence of the ATP binding site on the 
substrate site kinetics). (Figure 3). 



In this example the invention was used to map the active catalytic site of Syk, a protein 
kinase enzyme that catalyses the phosphorylation of a tyrosine residue. The example 
illustrates the positional scanning of the sub-set libraries for the mapping of the enzyme, 
and their use so as to allow the subsequent identification of the preferred substrates of the 
enzyme catalytic site. ~- 

The mapping and assessment of the catalytic site was performed as detailed in Example 1. 
The substrate preferences were deconvo luted as detailed in WO 97/42216 and are detailed 
below. 



Example 2 



Library Sub-Set 1 
Library Sub-Set 2 



Asp-Glu-Glu-Asp-Tyr [SEQ ID No. 11] 

Asp-Glu-Glu-Tyr-Asp [SEQ ID No. 12] 
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Library Sub-Set 3 
Library Sub-Set 4 
Library Sub-Set 5 



Asp-Glu-Tyr-GIu-Asp [SEQ ID No. 13] 
Asp-Tyr-Glu-Glu-Val [SEQ ID No. 14] 
Tyr-Ser-Ile-IIe-NIe [SEQ ID No. 15] 



Example 3 



In this example the invention was used to map the active catalytic site of CSK. The subset 
library was used to scan the enzyme active site so as to allow the subsequent identification 
and synthesis of the preferred specific substrates for the enzyme as listed below. 



In this example the invention was used to map the active catalytic site of v-Abl, a protein 
kinase enzyme that catalyses the phosphorylation of a tyrosine residue. For this enzyme 
only the Library Sub-Set 4 (i.e. X-Tyr-X-X-X according to SEQ ID No. 2) was scanned 
with the enzyme. The active site substrate recognition substrate for this enzyme for this 
Sub-Set was Serine-tyrosine-phenylalanine-histamine-glutamine [SEQ ID No. 21]. 



Library Sub-Set 1 
Library Sub-Set 2 
Library Sub-Set 3 
Library Sub-Set 4 
Library Sub-Set 5 



Asp-Glu-GIu-Glu-Tyr [SEQ ID No. 16] 

Asp-Glu-Glu-Tyr-Phe [SEQ ID No. 17] 
Asp-Glu-Tyr-His-Asn [SEQ ID No. 18] 
Asp-Tyr-His-Leu-Phe [SEQ ID No. 19] 
Tyr-Pro-Ile-Glu-Val [SEQ ID No. 20] 



Example 4 
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